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Simple proofs are given of the following classical theorems: (1) An arbitrary set of commuting 
matrices may be simultaneously brought to triangular form by a unitary similarity. (2) An arbitrary 
set of commuting normal matrices may be simultaneously brought to diagonal form by a unitary 
similarity. 
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1. Introduction 

The purpose of this note, which is expository, is 
to present proofs of two fundamental theorems on 
sets of commuting matrices. The theorems are classi- 
cal, but existing proofs tend to be unnecessarily com- 
plicated and furthermore are difficult to find in the 
literature. For these reasons the following simple 
proofs (along the lines set down by Frobenius and 
Schur in their original memoirs on group representa- 
tions) are of interest. In fact the basic tools are the 
concept of irreducibility and a simplified form of 
Schur's lemma, both from the theory of group repre- 
sentations. In addition some special information con- 
cerning normal matrices will be required, which we 
summarize briefly below. (Complete proofs may be 
found in MacDufTee's book.) 1 For simplicity, all the 
matrices considered below are over the complex field. 

A matrix A is normal if it commutes with A*, the 
conjugate transpose of A. A matrix U is unitary if 
U* = U~ 1 . Unitary matrices are themselves normal 
matrices, the unitary matrices form a group, and 
U*AU is normal if and only if A is normal. It is known 
that A is normal if and only if A = U*DU, where U is 
unitary and D diagonal. 

We also need 
Lemma 1. Let A be an arbitrary n X n complex matrix. 
Then there is a unitary matrix U and an upper tri- 
angular matrix T such that A = UT. 
PROOF. Let A x be the first column of A and suppose 
that A x 7^0. Put V l =A l /N(A l )(N{A l ) is the euclidean 
norm of A x ) and complete V\ by the Gram-Schmidt 
process to an orthonomal basis V\, V2, . . ., V n . 
Then the matrix V={V U V 2 , . . ., V n ) is unitary, and 
the first column of V*A is 



V*A~ 




~n{aS 


V%Ai 


= 





V*Ai m 








Define 



w- 



A,=0 



/f,^0. 



1 C. C. MacDuffee, The Theory of Matrices, New York, 1946. 



Then W is unitary, and the first column of W*A has 
all elements below the diagonal element 0. The proc- 
ess may now be repeated with the matrix W*A 
using unitary transformations of the form (1) + F, 
etc. We ultimately find that there is a unitary matrix 
U such that U*A = T is upper triangular. Hence 
A = UT, and the proof of the lemma is concluded. 
We also require the fact that if A is any matrix such 
that A A* has zero trace 2 then A must be the zero 
matrix. For if A = (ay) then 

tr(AA*)=2\aij\ 2 =N(A)\ 

and so tr(AA*) = implies A = 0. 

2. The Theorems and Their Proofs 

A set (finite or infinite) of n X n matrices 21 = {A} 
is said to be reducible if fixed positive integers p, q, 

2 The trace of a matrix A is the sum of its diagonal elements, and is a similarity invariant: 
i.e., remains unchanged under transformations of the form S~ X AS, where S is any nonsingu- 
lar matrix. 
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and a fixed nonsingular matrix S exist such that for 
each A e?l , 



(I) 



S-MS = | 



/An 







A 22j 



where A u is a pXp matrix, A V2 a pXq matrix and 
A 22 a qX q matrix. Otherwise 91 is said to be irreduc- 
ible. If the form (1) can be achieved with A V2 = as 
well for all AeW, then ?l is said to be fully reducible. 

Since any matrix is the product of a unitary by an 
upper triangular matrix (lemma 1) and a similarity 
transformation by an upper triangular matrix retains 
the block form of (1), the matrix S may be chosen 
unitary. From this remark it follows that ?l is re- 
ducible if and only if it it unitarily reducible, so that 
these are equivalent concepts. 

The basic lemma is the following: 
Lemma 2 (Schur's lemma, specialized). Let 91 = {A} be 
an irreducible set of n X n matrices, and let M be a 
fixed matrix such that for each Ae?l, there is a matrix 
A satisfying 

AM = MA. 

Then either M = 0orM is nonsingular. Furthermore if 
A = A (so that M commutes with each element of ?1) 
then M is scalar.''' 
PROOF. Suppose that the rank of M is r, and write 




where P, Q are nonsingular and I, is the rX r identity 
matrix. Then for each ^e^l. 



Thus A>\ = 0, an impossibility since 91 is irreducible. 
Hence r must be or n, and so either M = or M is 
nonsingular. This proves the first part of the lemma. 

Now suppose that M commutes with each element of 
91, and choose A as any eigenvalue of M. Then M — kl 
is singular and also commutes with each element of 
91. Hence M — A/ = 0, Af=X/, and the second part of 
the lemma is proved. 

We note that the lemma remains true if ?( is 
assumed unitarily irreducible. This remark will 
find application later. 

Now suppose that $\ = {A} is any set of n X n mat- 
rices. It is clear that after a suitable similarity has 
been performed, the matrices A may be taken so that 
with respect to some fixed partitioning, 

A=(A U ) 

where Aij = for &>/, and for each i the set 91* 
= {An} is irreducible. If we assume in addition that 
91 is a set of commuting matrices, then it follows that 
for each i 91 * is also a set of commuting matrices, and 
hence by lemma 2 that 91; consists entirely of scalar 
matrices. Hence we have proved the first of the two 
theorems: 

THEOREM 1. Let 91 = {A} be any set of commuting 
matrices. Then there is a fixed nonsingular matrix 
S (which may be chosen unitary) such that S _, AS is 
upper triangular for each Ac9l. 

Now let C?J = {A} be any set of n X n normal matrices. 
We first prove 

Lemma 3. If s 2l is unitarily reducible then it is uni- 
tarily fully reducible. 

PROOF. Suppose that ?l is unitarily reducible and let 
U be a unitary matrix such that with respect to some 
fixed partitioning, 



fir 



'(2) 



Put 



{P-'AP) 




0\ 



(QAQ 



'Au 


An 


A< 


A~ 


'A u 


i„ N 


In 


An 



P- l AP-- 



QAQ- 



where A u , Au_&re rXr matrices, A v >, A v >rX(n — r) 
matrices, A 2 \,A 2 \ (n — r) X r matrices and^ 2 2,^22(ft — r) 
X (n — r) matrices. Then (2) implies that 

(Au 0\ /A u A v \ 

Um 0/ \0 / 



U 'AU 



for each Ae%. 

Since normality is preserved under unitary similari- 
ties, the matrices U~ l AU are normal. Hence 

\ 




A* 




and it follows that 

Thus 

tr{AnA&)=0, 



aJ 



•Thai is. a multiple of th€ 



since AfiAn and A n A u have the same trace. By the 
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remarks made in the introduction, it follows that 
,4 12 = 0. This completes the proof of the lemma. 

From lemma 3 we obtain 
Lemma 4. There is a unitary matrix U such that for 
each A€?(, 



(3) 



U- 1 AU = A 11 -f A 22 +. . - + A r 



where the set ^i x = {Ay} is a unitarily irreducible set 
of normal matrices, 1 ^ i ^ r. 

Combining these lemmas, we obtain the second of the 
two theorems: 

THEOREM 2. Let s 2l = {A} be a set of commuting normal 
matrices. Then there is a fixed unitary matrix U such 
that U _1 AU is diagonal for each Ae?l. 
PROOF. Choose U so that the form (3) is achieved. 
Then the sets 21* = {An}, 1 ^ i ^ r are unitarily ir- 
reducible sets of commuting normal matrices. Lemma 
2 now implies that An is scalar for each^//e2lj, 1 ^ i ^ r 
from which the theorem follows. 



3. Consequences of the Theorems and a 
Problem 

These theorems have many important consequences, 
of which we mention two: 

(4) Let Aj, l^j^p be commuting nXn matrices 
and let /=/(#i, #2, . . • , x f >) be an arbitrary poly- 
nomial in Xi, $2, . . . , x p . Then there is a fixed order- 
ing of the eigenvalues of Aj, say Aj(l), M2), • • ., 
\}(n), l^j^p (which does not depend on f) such 
that the eigenvalues of f(A u A 2 , . . ., A p ) are pre- 
cisely /(MO , MO, • • ., MO), 1 ^i^w. 



(5) The irreducible representations of an abelian 
group are all of degree 1. 

There are also important applications in quantum 
mechanics, in the theory of the Hecke operators, and 
of course in group representations. 

The following problem has some interest: Give 
conditions for the simultaneous diagonalizability of 
a given set S J1 of commuting n X n matrices. One such 
criterion is furnished by Theorem 2. Another suffi- 
cient condition is that S ?I contain a diagonalizable non- 
derogatory matrix (one whose characteristic and min- 
imal polynomials coincide): for example, one with 
distinct eigenvalues. An inductive solution is as fol- 
lows: If s ?l consists entirely of scalar matrices, we arc 
through. If not, ?l must contain a diagonalizable non- 
scalar matrix B, and after a suitable similarity has 
been performed we may assume that 



(6; 



fi=A,/+A 2 /+. . .-f-A r J 



where r > 1 and A; = Aj if and only if i =j. Next (6) and 
the fact that the elements of §1 commute imply that 
if A is any element of 91 then 



A=Au+A»-> + , 



.+A rr 



where the partitioning is that imposed by the form B. 
The problem is now reduced to the study of the r com- 
muting sets 9l/ = {y4/j}, 1 ^ i ^ r, each of smaller di- 
mension than n, to which the procedure described 
above may be applied again, etc. The difficulty <>f 
course lies in recognizing when a given set of matrices 
contains a nonsealar diagonalizable element. 

(Paper 71 B2 & 3-201) 
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